# Low-resource speech processing
Whisper Small Ta
Apache-2.0
This model is a speech recognition model fine-tuned on the Tamil Common Voice 17.0 dataset based on OpenAI's Whisper Small, with a Word Error Rate (WER) of 43.23%.
Speech Recognition
Transformers Other

W
navin-kumar-j
38
1
Whisper Fa Tinyyy
MIT
Persian automatic speech recognition model fine-tuned based on OpenAI Whisper-tiny, trained on the common_voice_11_0 dataset
Speech Recognition
Transformers Other

W
hackergeek98
55
2
Arabic Alphabet Speech Classification
This is a transformers model for Arabic alphabet speech classification, capable of recognizing and classifying the pronunciation of Arabic letters.
Audio Classification
Transformers

A
HamzaSidhu786
60
1
Whisper Large V3 Taiwanese Hakka
A Whisper-large-v3 fine-tuned model for Taiwanese Hakka speech recognition, supporting multiple Hakka dialects
Speech Recognition
Transformers Other

W
formospeech
41
5
Vegam Whisper Medium Ml
MIT
This is a version of thennal/whisper-medium-ml converted to the CTranslate2 model format for Malayalam speech recognition
Speech Recognition Other
V
smcproject
83
5
Exp W2v2t Th Hubert S533
Apache-2.0
A Thai speech recognition model fine-tuned from facebook/hubert-large-ll60k, trained on data from Common Voice 7.0
Speech Recognition
Transformers Other

E
jonatasgrosman
19
0
Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53 V3
Apache-2.0
Automatic speech recognition model based on wav2vec2-large-xlsr-53, fine-tuned on the GARY109/AI_LIGHT_DANCE dataset
Speech Recognition
Transformers

A
gary109
191
0
Asr Wav2vec2 Dvoice Amharic
Apache-2.0
This is an automatic speech recognition model for Amharic, trained using wav2vec 2.0 architecture with CTC/Attention mechanism
Speech Recognition Other
A
speechbrain
96
9
FYP ARABIZI
Apache-2.0
This model is a speech recognition model fine-tuned on an unknown dataset based on facebook/wav2vec2-large-xlsr-53, supporting recognition of Arabic dialects (Arabizi).
Speech Recognition
Transformers

F
ali-issa
33
1
Wav2vec2 Large 100h Lv60 Self
Apache-2.0
Wav2Vec2-Large-100h-Lv60 is a large model pre-trained and fine-tuned on 100 hours of Libri-Light and Librispeech speech data, trained with self-training objectives, suitable for speech recognition tasks with 16kHz sampling rate.
Speech Recognition
Transformers English

W
Splend1dchan
17
0
Wav2vec2 Common Voice Tr Demo
Apache-2.0
This model is a speech recognition model fine-tuned on the Turkish Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers Other

W
YiTian
30
0
Wav2vec2 Large Xlsr Turkish
Apache-2.0
A speech recognition model fine-tuned on the Turkish Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Other
W
m3hrdadfi
384
7
Wav2vec2 Large Xlsr Arabic Demo Colab
Apache-2.0
An Arabic speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers

W
Wiam
22
0
Wav2vec2 Large Xlsr 53 Hungarian
Apache-2.0
This is a Hungarian automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.
Speech Recognition Other
W
anton-l
17
0
Fb Youtube Vi Large
Apache-2.0
This model is an automatic speech recognition model fine-tuned on Vietnamese YouTube informal audio datasets, based on facebook/wav2vec2-large-xlsr-53.
Speech Recognition
Transformers

F
phongdtd
31
1
Wavlm VLSP Vi
A Vietnamese automatic speech recognition model fine-tuned on the PHONGDTD/VINDATAVLSP - NA dataset based on microsoft/wavlm-base-plus
Speech Recognition
Transformers

W
phongdtd
21
0
Wav2vec2 Large Xlsr Tamil Commonvoice
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice Tamil dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers

W
nikhil6041
43
0
Wav2vec2 Large Xlsr 53 Sw
Apache-2.0
Swahili automatic speech recognition model fine-tuned on XLSR-53 large model, supports 16kHz sampling rate audio input
Speech Recognition Other
W
alokmatta
158
2
W2v Timit Ft 4001
A speech recognition model based on Wav2Vec 2.0 architecture, fine-tuned on the TIMIT dataset, suitable for English speech-to-text tasks
Speech Recognition
Transformers

W
devin132
22
0
Wav2vec2 Large Xlsr Finnish
Apache-2.0
This is an automatic speech recognition model fine-tuned on Finnish based on facebook/wav2vec2-large-xlsr-53, trained using the Common Voice dataset.
Speech Recognition Other
W
birgermoell
22
0
Unispeech 1350 En 168 Es Ft 1h
UniSpeech is a unified speech representation learning model that combines labeled and unlabeled data for pre-training, specifically fine-tuned for Spanish phoneme recognition.
Speech Recognition
Transformers Spanish

U
microsoft
19
0
Wav2vec2 Base 10k Voxpopuli Ft Cs
A speech recognition model based on Facebook's Wav2Vec2 architecture, pre-trained with 10K unlabeled Czech data from the VoxPopuli corpus and fine-tuned on Czech transcription data.
Speech Recognition
Transformers Other

W
facebook
226
0
Wav2vec2 Large Xlsr Upper Sorbian Mixed
Apache-2.0
This is an Upper Sorbian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on data from the Common Voice dataset and online Sorbian courses.
Speech Recognition Other
W
jimregan
25
0
Wav2vec2 Base Timit Demo Colab
Apache-2.0
A speech recognition model fine-tuned on the common_voice dataset based on anas/wav2vec2-large-xlsr-arabic
Speech Recognition
Transformers

W
nadaAlnada
16
0
Xls R Ab Test
This model is an automatic speech recognition model fine-tuned on the Common Voice 7.0 AB dataset, based on the XLS-R dummy architecture
Speech Recognition
Transformers Other

X
cahya
20
0
Wav2vec2 Xls R 300m W2V2 XLSR 300M YAKUT SMALL
Apache-2.0
This is a speech recognition model fine-tuned on the Yakut (Sakha) language dataset based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition
Transformers Other

W
emre
90
0
Arabic Speech Recognition
Apache-2.0
An Arabic automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampling rate audio input
Speech Recognition Arabic
A
mohammed
37
2
Wav2vec2 Xls R 300m Lg
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the COMMON_VOICE - LG dataset, supporting automatic speech recognition tasks for Luganda (lg).
Speech Recognition
Transformers Other

W
samitizerxu
22
0
Sew D Small 100k Ft Timit
Apache-2.0
An automatic speech recognition model fine-tuned on the TIMIT_ASR dataset based on asapp/sew-d-small-100k
Speech Recognition
Transformers

S
patrickvonplaten
18
0
Wav2vec2 Large Xls R 300m My Hindi Home Colab
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on a general speech dataset, suitable for speech recognition tasks.
Speech Recognition
Transformers

W
nimrah
16
0
Wav2vec2 Large Xls Ar
Apache-2.0
An Arabic automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, achieving a WER of 52% on the Common Voice Arabic dataset.
Speech Recognition
Transformers Arabic

W
mohamed1ai
30
1
Wav2vec2 Base 10k Voxpopuli Ft Sk
Pre-trained on 10K hours of unlabeled VoxPopuli corpus data and fine-tuned on Slovak transcription data
Speech Recognition
Transformers Other

W
facebook
39
1
Wav2vec2 Base 10k 8khz Pt Cv7 2
Apache-2.0
This model is a Portuguese automatic speech recognition model based on the wav2vec2 architecture, fine-tuned on the Common Voice 7 dataset, supporting 8kHz sample rate audio input.
Speech Recognition
Transformers Other

W
lgris
24
2
Wav2vec2 Large Xlsr Turkish Demo Colab
Apache-2.0
This model is a fine-tuned Turkish speech recognition model based on facebook/wav2vec2-large-xlsr-53 on the Common Voice dataset
Speech Recognition
Transformers

W
patrickvonplaten
14
2
Wav2vec2 Large Xls R 300m Ab V4
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Abkhazian (ab) dataset based on Facebook's wav2vec2-xls-r-300m model
Speech Recognition
Transformers Other

W
Arxived
16
0
Xls R Ab Test
This is an automatic speech recognition model fine-tuned on the COMMON_VOICE - AB dataset, based on the XLS-R Dummy architecture
Speech Recognition
Transformers Other

X
FitoDS
22
0
Featured Recommended AI Models